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Abstract - This manuscript contains technical results related to a particular approach 
for the design of Model Predictive Control (MPC) laws. The approach, named "gen- 
eralized" terminal state constraint, induces the recursive feasibility of the underlying 
optimization problem and recursive satisfaction of state and input constraints, and it 
can be used for both tracking MPC (i.e. when the objective is to track a given steady 
state) and economic MPC (i.e. when the objective is to minimize a cost function which 
does not necessarily attains its minimum at a steady state). It is shown that the proposed 
, technique provides, in general, a larger feasibility set with respect to existing approaches, 

O ■ given the same computational complexity. Moreover, a new receding horizon strategy 

is introduced, exploiting the generalized terminal state constraint. Under mild assump- 
tions, the new strategy is guaranteed to converge in finite time, with arbitrarily good 
accuracy, to an MPC law with an optimally-chosen terminal state constraint, while still 
enjoying a larger feasibility set. The features of the new technique are illustrated by 
three examples. 

I> 

q 

■ 1 Introduction 

o: 

Model Predictive Control (MPC, see e.g. [551 H]) is one of the few existing techniques that is able to 
cope, in a quite straightforward way, with the presence of multiple inputs and outputs, of nonlinear 
dynamics and of hard constraints on the system state, x, and input, u. In MPC, at each time step t 
the input u{t) is computed by solving a finite horizon optimal control problem (FH0CP). The cost 
function to be minimized in the FHOCP is typically the average, over a finite horizon of N < oo steps, 
of the predicted values of a stage cost function, l(x, u). The latter is chosen by the user, according to 
the goal to be achieved in the control problem at hand. In particular, there are two main classes of 
problems, giving rise to two different kinds of cost functions, respectively. In the first class, typically 
referred to as tracking MPC, the aim is to drive the system state and input to reach a given set point 
or reference trajectory. The stage cost l(x, u) employed in tracking problems is therefore related to the 
deviation of the predicted state and input trajectories from the reference ones. Most of the existing 
MPC formulations are concerned with this first class of problems, and a quite vast literature has been 
developed in the last decades [26 , addressing nominal stability and recursive feasibility [30] , as well as 
robustness analysis and robust design (see e.g. [BJ HH [T7J [UJ H3] ) ■ m tracking MPC, the typical way 
to guarantee recursive feasibility of the FHOCP, as well as asymptotic stability of the target reference 
trajectory, is the use of a suitable cost function, of a sufficiently long horizon N and/or of "stabilizing 
constraints" , like state contraction constraints [351 IS] > Lyapunov-like constraints [3D] > terminal state 
constraints [H] and terminal set constraints [2"T] . 
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The second class of problems is that of economic MPC, where the stage cost is not directly related 
to a prescribed set point or trajectory to be tracked, but it expresses a performance to be optimized. 
A typical application field, in which economic MPC is of high interest, is process control, where the 
common approach consists of two hierarchical levels: at the upper level, a desired set point, according 
to the economic objective, is computed; at the lower level, a MPC law is used to track such a set 
point. In this context, economic MPC can be regarded as an integration of these two levels into a 
single predictive controller [20] [29]. More generally, economic MPC is an attractive approach for all 
control problems where the "best" performance, from the point of view of the economic objective, is 
not attained at any steady state or periodic trajectory, and/or one wants to avoid the pre-computation 
of a trajectory to be stabilized with tracking MPC. Economic MPC has been applied in practice in 
various fields, including process control [3D1HHS1, renewable energy and energy efficiency [TO1 1521 IB] 
and transportation [22, SUE], and the literature concerned with the theoretical properties of economic 
MPC schemes is all quite recent (29] HJ [3] [101 [HI G]. In most of the existing studies, a fixed point 
(x s ,u s ) is computed that minimizes the average economic cost among all the admissible fixed points. 
Then, sufficient conditions on the FHOCP problem are derived, in order to make such a steady state 
asymptotically stable for the closed-loop system with an economic MPC law. In particular, in [TO] [2] 
a terminal state constraint is used, to force the predicted state at time step t + N to be equal to x s , 
and conditions on the economic cost function are derived, under which asymptotic stability of (x s , u s ) 
is guaranteed. In [2], an asymptotic time-average economic criterion is also introduced, in order to 
analyze the performance of economic MPC schemes. In |18j . the same time-average performance as 
[2] is considered, but no terminal state constraint is used, and sufficient conditions on the prediction 
horizon and on the cost function are derived, under which the asymptotic time-average closed-loop 
performance is "approximately optimal" , i.e. it converges to a value close to the minimal one. 
In the described context, we investigate here the use of a terminal state constraint, which we call 
"generalized" because it requires the state at time step t + N to be equal not to a specific fixed point, 
e.g. a set point to be tracked or a previously derived optimal fixed point, but to any fixed point. 
This generalized terminal state constraint can be used for the design of either tracking or economic 
MPC schemes: in this paper, we study its properties in both contexts using a unified framework. In 
particular, we show that the use of the generalized terminal state constraint yields a larger feasibility 
set, with respect to a classical terminal state constraint approach. Moreover, we propose a novel 
receding horizon algorithm that, under mild assumptions, converges in finite time, with arbitrarily 
good accuracy, to an MPC law with an optimally chosen terminal state constraint, while still retaining 
a possibly larger feasibility set. Finally, we apply the approach to three examples. It has to be noted 
that this idea had been studied previously in the literature in the context of linear systems (24] [13] and 
nonlinear ones [12[ 114] , but with different assumptions and a different approach with respect to the 
one proposed here. A discussion about such differences is beyond the scope of this manuscript and will 
be included in future works. The paper is organized as follows. The problem settings are described 
in Section [2] the generalized terminal state constraint, the related FHOCP, its receding horizon 
implementation and the recursive feasibility property are treated in Section [3] Section 0] is concerned 
with the guaranteed performance of the approach and the novel receding horizon implementation. 
Finally, examples are given in Section [5] and conclusions in Section [6] 

2 Notation and problem formulation 

We consider discrete-time system models of the form: 



where / : R n x R m — > R n , t £ Z is the discrete time variable, x(t) £ R n is the system state and 
u(t) £ R m is the input. State constraints are described by a set X £ R™, and input constraints by a 
compact set U £ M. m . Mixed state- input constraints can be also considered, but they are omitted here 
for simplicity. The values of the generic variable y at time t + j, predicted at time t, are indicated as 
y(j\t), j e N. Let I : R™ x R m -» R be a stage cost function, let N £ N, < N < oo be a prediction 
horizon, finally define the cost function J s as: 



x(t+l)=f{x{t),u(t)), 



(1) 



iV-1 




(2) 



3=0 
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where U = {u(Q\t), . . . ,u(N — l\t)} is a sequence of N predicted control inputs. Then, the following 
Finite Horizon Optimal Control Problem (FHOCP) V s (x(t)) can be formulated: 



V s (x(t)) : 

min J s (x(t),U) (3a) 
subject to 

x(j\t) = f{x(j-l\t),u(j-l\t)),j = l,...,N (3b) 

u(j\t)eJJ,Vj = 0,...,N-l (3c) 

x(j\t)eX,Vj = l,...,N (3d) 

x(0\t) = x(t) (3e) 

x(N\t)=x s , (3f) 



where x s e X is fixed and chosen, together with the associated control input u s £ U, among the 
(possibly multiple) fixed points (x, u) that minimize the stage cost I (see e.g. |10j): 

Definition 1 (Optimal fixed point) 

(x s ,u s )£&rg min Ux.u) 

a;GX,uGU 

subject to (4) 
f(x, u) — x = 0. 

Problem V s is, in general, a nonlinear program (NLP) and, under mild regularity assumptions on / 
and I, a (possibly local) minimum can be computed by using a numerical solver, indicated as A. At 
a generic time step t, we denote such a solution as U*(x(t)), and the corresponding optimal value as 
J s *(x(t)) = J s (x(t),U*(x(t))). 

Remark 1 When V s is a general NLP, the outcome (J s *, U*) of the numerical solver X is a func- 
tion of both the parameter x(t) and the starting sequence U(x(t)), with which the solver is initialized. 
However, for the sake of simplicity of notation, we drop the dependence on U(x(t)), with the conven- 
tion that, unless a starting sequence is explicitly specified, the solver X includes also an initialization 
procedure. 

Remark 2 In many works on economic MFC, it is assumed that a global solution ofV s {x{t)) (as well 
as of problem ((4]) ) can be computed, which is in general difficult to achieve for non-convex problems. 
Here, we first consider generally non-convex FHOCPs, and we will invoke convexity (or more gen- 
erally the capability to always compute a global solution) only when necessary, in particular to prove 
performance and convergence results. 

The feasibility set F s is defined as follows: 

Definition 2 (Feasibility set) 

J- s = {x : V s (x) admits a solution}. 

Let B(r,x) = {y : \\y — x\\ p < r} for some p G [1, oo). We consider the following assumption on the 
set J'": 

Assumption 1 (Non- emptiness and boundedness of the feasibility set) 

I) T s ^ 

II) 3r <oo:T s C B(r,0). 

Assumption [1] is quite general, since I) holds true if and only if the problem ^ is feasible, i.e. if there 
exists at least one fixed point that satisfies state and input constraints, and II) is either inherently 
satisfied by the FHOCP |3]), or it can be enforced in most practical applications, where typically the 
state values that are meaningful for the problem at hand are contained in a bounded set. 
In MPC, the FHOCP ([3|) is solved at each time step by updating the measure of the state variable 
x(t) according to a receding horizon (RH) strategy: 

Algorithm 1 (RH control with terminal state constraint) 
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1. (initialization) given x(0) £ F s , let t = 0, and solve the FHOCP V s (x(Q)); let U*(x(0)) be a 
solution. Apply to the system the control input u(0) = u*(0|0). Set t — 1; 

2. solve the FHOCP V s (x(t)) by initializing the solver A with U = {u*(l\t - 1), ... , u*(N - l\t - 

let U*(x(t)) be a solution; 

3. apply to the system the control input u(t) = u*(0\t); 
4- set t — t + 1 and go to 2). 

We denote the state feedback control law, implicitly defined by Algorithm 1, as u(t) = n s (x(t)), k s : 
F s — > U. In the absence of noise and model uncertainty, for any given initial state x(0) £ J- s , 
Algorithm 1 guarantees recursive feasibility at all time steps t > 0, i.e. x{t) £ F s , Vi > 0. Recursive 
feasibility is achieved by initializing the solver, at step 2) of the algorithm, with the tail of the previous 
solution and the input u s , corresponding to the steady state X s . This, in turn, guarantees recursive 
satisfaction of state and input constraints, i.e. x(t) £ X, Vt > 0, K s (x(t)) £ U, V< > 0. 
The stage cost /(•, •) is chosen according to the considered control problem. In particular, in tracking 
MPC problems, the function l(x,u) is often chosen as a quadratic function of the state and input 
tracking errors: 

l(x,u) = \\x-x s f Q + \\u-u s f R , (5) 

where \\d\\m = (y T My) 1 / 2 and Q = Q T , R = R T , Q, R >~ 0. With this choice (or, more generally, with 
any function I such that l(x, u) > 0, V(x, it) £ PxU, and such that l(x, u) = (x, u) = (x s , u s )), 

Algorithm 1 guarantees asymptotic convergence of the state and input trajectories to the optimal fixed 
point. With the addition of other, quite general assumptions on the regularity of / and I and with 
long enough horizon N, Algorithm 1 guarantees asymptotic stability of the fixed point (x s ,u s ), with 
basin of attraction F s and some robustness margin (see e.g. [21)1 [TSl [TCI [T7] ) . 

In economic MPC problems, the stage cost I is chosen according to some criterion that has to be 
minimized (or maximized), e.g. energy loss/production, fuel saving, etc.. In these cases, Algorithm 1 
still guarantees recursive feasibility and state and input constraint satisfaction, however convergence 
and stability properties are not guaranteed in general, since they depend on the properties of the 
stage cost I. Sufficient conditions for asymptotic stability of the fixed point (x s , u s ) with an economic 
stage cost have been derived in [TUJ[2]. However, in economic MPC the stability of (x s ,u s ) may be 
not relevant with respect to the control objective: in fact, while in tracking MPC the cost to be 
minimized attains its global minimum at the fixed point (x s , u s ), which can be regarded as the "best" 
operating point, in economic MPC the stage cost may not attain its minimum at any steady state, 
and a steady state solution might not be the most satisfactory operating condition for the system. 
In [2], an asymptotic time-average economic performance criterion, denoted here as Joo, has been 
introduced, defined as: 

J2l(x(t),u(t)) 

Joo = hm sup *~° . (6) 

T^roo 1+1 

The asymptotic average appears to be more suited, with respect to stabilization of (x s ,u s ), to 
represent the control objective in economic MPC problems. Clearly, in closed-loop operation the value 
of Joe is a function of the employed control law. In [2], it has been proved that: 

Joo(k s ) <l(x s ,u s ) 7 (7) 

thus showing that the use of Algorithm 1 gives an asymptotic time-average economic performance 
that is better than or equal to that of the stage cost associated to the "best" steady state. 
In both tracking and economic MPC, the use of the FHOCP ([3]) in Algorithm 1 represents a straight- 
forward way to achieve recursive feasibility and constraint satisfaction, however it is well known that 
the terminal state constraint ([3f| may be quite restrictive, so that typically quite "long" prediction 
horizons N have to be employed to achieve a satisfactorily large feasibility set !F S , with a consequent 
higher computational complexity with respect to other techniques, like dual- mode MPC [26) . In this 
paper, we adopt a particular terminal state constraint that aims to reduce this drawback, and we 
analyze the properties of the resulting closed-loop system in the case of both tracking and economic 
MPC. 
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3 Generalized terminal state constraint 



Let V = {v(0\t), . . . , v(N\t)} G l mxA,+1 be a sequence of N + 1 predicted control inputs, up to time 
t + N, let (3 e M + and > l(x s ,u s ) be two scalars, whose role will be better specified later on, and 
define the cost function J as 

AT-1 

J(x(t),V) = J2 l(x(j\t)MM) + Pl{x(N\t), v(N\t)). (8) 

3=0 

Then, we propose to replace the FHOCP ([3]) with the following: 

V(l(t),x(t)) ■■ 

min J(x(t),V) (9a) 
subject to 

x(j\t) = f(x(j - l\t), v(j - l|t)), j = 1, . . . ,N (9b) 
v{j\t)eV,Vj = 0,...,N (9c) 
x(j\t)£X,Vj = l,...,N (9d) 
x(0\t) = x(t) (9e) 
x(N\t)-f(x(N\t),v(N\t))=0 (9f) 
l(x(N\t),v(N\t)) <l(t). (9g) 

We denote a (possibly local) solution of V(l{t), x{t)) as V*(j{t),x(t)) = {v*(Q\t), . . . , v*(N\t)}, and 
the corresponding optimal value as J*(l(t), x(t)) = J(x(t), V*(l(t), x(t))). Moreover, we indicate with 
x *U\t)> J £ [0)-/V] the sequence of predicted state values, computed by using the model (JTJ) , starting 
from x*(0\t) — x(t) and applying the control sequence V*(l(t), x(t)). 

Remark 3 The FHOCP V(l(t), x(t)) has (N + l)m optimization variables, i.e. m more than problem 
V s (x(t)) ([3]). Depending on the considered application, this might or might not be an issue. This slight 
increase in the number of optimization variables can be seen as the "price" for generalizing the terminal 
state constraint ([9fj) . Moreover, we show in our examples how the use of a generalized terminal state 
constraint, with a much shorter horizon N , can yield closed-loop performance that are similar to 
those obtained with a fixed terminal state constraint and longer horizon, thus effectively reducing the 
computational effort. 

The generalized feasibility set T is defined as: 

Definition 3 (Generalized feasibility set) 

J- = {x : V(J, x) admits a solution for some I}. 

For a given x(t) £ J 7 , let us define the set X(x(i), N) as follows: 

Definition 4 (Set of reachable fixed points) 

X(x{t),N) = 

{xeX:3V e l mxM+1 : v(j\t) G U, Vj G [0, N]; 

x(N\t) = x; f(x, v(N\t)) = x; (10) 

x(j\t) = f(x(j - l\t),v(j - l\t)), Vj G [1,7V] 

a;01t)€X,Vje[l,JV]}. 

The set X(x(t),N) contains all the possible steady state values that can be reached in at most N 
steps with an admissible control sequence V , starting from x(t). It is straightforward to note that if 
N 1 > N 2 , then X(x(t),N 1 ) D X(x(t), N 2 ). The following result is concerned with the relationship 
between the sets X(x(t), N) and J- s . 

Proposition 1 Let Assumption^ hold. Then: 

x s eX(x(t),N) <^=> x{t)eT s (11) 
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Proof 1 If x s iE X(x(t), N), then by how the set X(x(t),N) is defined, there exists a sequence V of 
N+l predicted control moves, such that all of the constraints (|9bp - (|9fj) are satisfied, with x(N\t) = x" . 
Therefore, the first N elements of such a sequence satisfy also constraints (|3a[) - (j3f)) . hence x(t) £ J- s . 
Conversely, if x(t) £ T s , then there exists an optimal solution U*(x(t)) satisfying constraints (I3al) - 
(|3fp. Thus, the sequence V — {U*(x(t)), u*(N\t)} satisfies the constraints in (|10p with x(N\t) — x s , 
i.e. x s £ X(x(t),N). ' □ 

We also define the quantity l[x(i)) as: 

Definition 5 (Optimal achievable stage cost) For a given x(t) £ T, the optimal achievable stage cost 
is: 

l(x(t)) = min l(x,u) 

subject to 

(12) 

f(x,u) = X. 

Assumption 2 (Existence of the optimal achievable cost) For any x £ T , the value l(x(t)) of Defi- 
nition^ exists. 

Assumption^ holds in most practical cases, considering that the input constraint set U is compact, 
the horizon N is finite and the stage cost I can be chosen by the designer. 
We can now define the set S = {(7, x) : I > l(x), x £ J 7 }, as well as the following functions: 

K(j(t),x(t)) = v*{0\t) 

((l(t),x(t))=l(x*(N\t),v*(N\t)) (u) 

The value n(l(t),x(t)) corresponds to the first control input in the sequence V*(l(t),x(t)) and the 
value ((J(t),x(t)) is the cost associated to the terminal state-input pair, obtained by applying to sys- 
tem ([T]) the sequence V*(l(t),x(t)), starting from the initial condition x(t). 
The following RH strategy is obtained by recursively solving the FHOCP (|9|): 

Algorithm 2 (RH control with generalized terminal state constraint) 

1. (initialization) choose a value of (3 > 0. Given x(0) G T , choose a value 1(0) such that 
(7(0), x (0)) e S and let t = 0. Solve the FHOCP 7>(7(0),x(0)); let V*(l(0),x(0)) be a solu- 
tion. Apply to the system the control input u(0) = k(7(0), x{0)). Set t = 1; 

2. set l(t) = C(l(t — 1), x(t — 1)) and solve the FHOCP V(l(t), x{t)) by initializing the solver A with 
V = {v*(l\t- 1),. ..,v*(N\t- l),v*(N\t- 1)}; let V*(l{t), x{t)) be a solution; 

3. apply to the system the control input u(t) = K,(l(t),x(t)); 

4- set t = t + 1 and go to 2). 

Remark 4 From a practical point of view, we note that the value of l(x(0)) needs not to be known 
explicitly in the initialization step of Algorithm 2, when selecting 7(0) € S: in fact, by construction 
any value of 7(0) such that the problem 7-* (7(0), x(0)) is feasible belongs to S. 

The application of Algorithm 2 gives rise to the following closed-loop system: 

x(t + l) = f(x(t), K (l(t),x(t))) (14a) 

~l(t + l) = C(l(t),x(t)) (14b) 

We denote with tfj(k,l(t),x(tj) and <f>{k ,l(t) , x{t)) the values of the bound l{t + k) and of the state 
x(t + k), respectively, at the generic time t + k, k £ N, obtained by applying (fl"4"l) starting from x(t) 
and l(t). 

Our first result is concerned with the existence of J- and its relationship with the set J- s and with 
the properties of recursive feasibility of problem "P(7(t), x(t)) in Algorithm 2, hence of the capability 
of the control law k to satisfy input and state constraints. 
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Theorem 1 Let Assumption^ hold, and consider the closed-loop system (|14p . obtained by applying 
Algorithm 2 with any j3 > in the FHOCP V . The following properties hold: 

a) (feasibility set) 

JDJ S . 

b) (recursive feasibility) 

■P(i/j(t,7(0),x(0)),4>(t,l(0), x(0))) is feasible 
V(7(0),x(0)) e S, Vt > 

c) (state constraint satisfaction) 

0(U(O),x(O)) e X, V(7(0),x(0)) G S, Vt > 

d) (input constraint satisfaction) 

K(^(t,I(0),x(0)),^,I(0),x(0))) € U, V(7(0),x(0)) e 5, Vt > 0. 

Proof 2 a) By Assumption^ there exists a set T s of state values such that problem V s is feasible. 
Then, it is straightforward to note that also problem V(l(x s , u s ), x(t)) is feasible for all x(t) € J- s . 

b) For any given (7(0), x(0)) € S, by definition problem V(J(0), x(0)) admits a solution V*(7(0), x(0)). 
Clearly, the pair (x*(N\0),v*(N\0)) is a fixed point for system ([1}. According to Algorithm 2 the 
control move u(0) — u*(0|0) is applied. At t = 1, the new state value is x(l) = </>(l,7(0), x(0)) = 
/(x(0),u*(Q|t)) — x*(l\t), and the solver A is initialized with the starting sequence V, which includes 
the tail {v*(l\0), . . . , v*(N\0)} of sequence V*(Z(0),x(0)), plus the terminal input v(N\l) = w*(iV|Q). 
Let us denote with x(j|l), j £ [0,N], the corresponding sequence of predicted state values. Then, it 
can be noted that x(N- 1|1) = x(N\l) = x*{N\0), i.e. the pair (x(N\l), v(N\l)) = (x*(N\0),v*(N\0)) 
satisfies constraint J9fj). Constraints (|9b p - (|9dll are also satisfied, since the predicted state and input 
sequences are the same as the ones computed at the previous time step, plus an additional state- 
input pair (x{N\l),v(N\X)). Finally, constraint ( |9g[ ) is satisfied too, since ^(l,7(0),x(0)) = 7(1) = 
C(7(0),x(0)) = l(x*(N\0),v*(N\0)) = l(x(N\l),v(N\l)). Therefore, the sequence V is admissible for 
problem 

V(ip(l, 7(0), x(0)), 0(1,7(0), x(0))). The same reasoning applies recursively for all t>0. 

c) -d) Straightforward consequences of the recursive feasibility property. □ 

Remark 5 According to Theorem QJ the generalized feasibility set T is positively invariant for the 
trajectories 0(t, 7(0), x(0)) of system ([T]) with the feedback control law k, and it is at least as large as 
the feasibility set of problem V s . This means that, in the "worst" case, for given x(0) problem V is 
feasible if and only if V s is, but, depending on the system ([1]), it may happen that V is feasible for a 
larger set of state values, given the same prediction horizon N , or alternatively that V is similar to 
V s , but with a shorter prediction horizon, i.e. lower computational complexity. Moreover, the set S 
is also forward invariant for the values of (^/>(t,7(0), x(0)), 0(t,7(O), x(0))). 

The generalized feasibility set may be larger than the feasibility set obtained with a fixed terminal 
state constraint; however nothing can be said, in general, about the performance of the closed-loop 
system obtained by applying Algorithm 2, in terms of stability of the target steady state in tracking 
MPC, and of asymptotic average cost in economic MPC. In fact, the performance guarantees achieved 
by Algorithm 1, with a fixed terminal state constraint, are a direct consequence of the fact that the 
employed value of (x s ,u s ) has been optimally chosen off-line, according to the control problem at 
hand. On the contrary, in Algorithm 2, the terminal state and input (x* (N\t),v* (N\t)) are different, 
in general, from the values (x s ,u s ), and they are allowed to change at each time step t. Basically, 
the values of (x* N \t> v *N\t) are implicitly "selected", among all the possible steady states that can 
be reached in at most N steps from the actual state x(t), by the numerical solver A, in order to 
minimize the cost J(x, V) |5J). Therefore, for given control horizon iV and constraints X, U, the values 
(x*(N\t), v*(N\t)) depend on the chosen stage cost •) and on the scalar weight (3. Moreover, it can 
be noted that the use of Algorithm 2 gives rise to a sequence of pairs {(x*(N\t), v*(N\t))}%L , and 
consequently a sequence of terminal cost values {l(x* (N\t), v* (N\t))}^ . The performance achieved 
by the system (|14[) clearly depends on the behavior of such a sequence. In this regard, we note that 
the control law u(t) = K{t(t),x(t)), obtained by using Algorithm 2, is the output of a dynamical 
system, with internal state 7(t) and input x(t). This is in contrast with the typical MPC control laws, 
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like k s , that are static feedback controllers. The controller's state l(t) traces the value of the stage 
cost associated with the terminal state- input pair l(x*(N\t), v*(N\t)), hence it carries the information 
about how suboptimal is such a terminal cost with respect to the optimal one, l(x s , u s ). The inequality 
l(x*(N\t), v*(N\t)) < K x *N\t-i> v N\t-i) = ^(*)' en f° rce d by means of constraint ( |5g| , ensures that the 
sequence {K x N\t' v N\t)}t=o ^ s n °t increasing, however, in the general settings considered so far, there 
is no guarantee of convergence to the optimal value l(x s ,u s ), or to a value close to the optimal. As 
a consequence, no guaranteed performance properties can be obtained. One option to deal with this 
issue is to use Algorithm 2 as it is, to set some initial choices of N , I and /3 and to tune these parameters 
following a trial-and-error procedure, in order to obtain a satisfactory closed-loop performance. Indeed, 
quite good results can be typically obtained in this way, as highlighted in the examples of Section 
[SJ Another option is to consider additional assumptions on the problem, to derive guidelines on how 
to choose N, I and (3, as well as to adopt a more sophisticated receding horizon algorithm, in order 
to guarantee a desired behavior of the sequence {l(x*(N\t),v*(N\t))}%L , in terms of convergence to 
a value which is arbitrarily close to the optimal one, l(x s ,u s ). Such a modified algorithm and its 
properties are described in the next section. 

4 Guaranteed properties of MPC with generalized terminal 
state constraint 

We first establish sufficient conditions on f3 under which the terminal state and input pair (x* (N\t) , v* (N\t)) , 
computed by solving problem V(l(t),x(t)), has an associated cost l(x*(N\t), v*(N\t)) which is arbi- 
trarily close to the minimal one, among all the possible steady states that can be reached from x(t). 
In order to do so, we consider the next three assumptions. We recall that a continuous, monotonically 
increasing function a : [0, +oo) — > [0, +oo) is a /Coo function if a(0) = and lim a(a) — +oo, and 

a— v+oo 

denote such functions as a € /Coo- 

Assumption 3 (Boundedness of the generalized feasibility set and continuity of f and I) 

I) 3r < oo :Jc6(r,0); 

II) / and I are continuous on T x U, where T is the closure of J- ' , hence 3 a/, a; € /Coo -\\f(x,u) — 

f(x, u)\\ < <Xf(\\(x, u) - (x, u)\\), \l(x,u)- l(x,u)\ < ai(\\(x,u) - (x, u)\\), V(s, u), (x, i)eJxU, 
for some vector norm |j • ||. 

Similarly to the set F s , the set T in many cases is bounded in the presence of bounded state and 
input constraints, while, depending on the system equations /, it might be unbounded if the state 
constraint set X is unbounded, even with a finite horizon A, due to the generalized terminal state 
constraint (|9f|) . However, in practical applications the initial state values that are meaningful for the 
problem at hand are typically contained in a bounded set F C K ra , so that one can always consider a 
"reduced" feasibility set T = J- n F to satisfy Assumption [3J-I) . 

Assumption 4 (Solution of the FHOCP V ) 

For any (7, x) € S, the FHOCP V(l,x) has at least one global minimum, which is computed by the 
solver A independently on how it is initialized. 

As already discussed in Remark [2J Assumption 2] is quite usual in the context of economic MPC. 
Moreover, it is satisfied if problem V(J,x) is convex, which is the important case of MPC for linear 
systems with convex constraints X, U and convex stage cost I. 

Assumption 5 (Stage cost) 

There exists a set A4 C J- x U where the function I attains its minimum. Without loss of generality, 
l(x, u) > 0, V(x, u) G T x U, and l(x, u) — (x, u) e M. 

Assumption[5]is obviously satisfied for tracking MPC with stage costs like ([5]), with A4 = {(x s ,u s )}. 
In the case of economic MPC, satisfaction of Assumption [5] depends on the stage cost chosen by the 
control designer, and the set M. typically does not contain any steady state and might also be not 
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connected. 



We can now derive a result related to the optimality of the pair (x*(N\t), v*(N\t)) with respect to 
the stage cost function I. 

Proposition 2 Let Assumptions [T^ hold. Then, for any e > 0, there exists a finite value /3(e) such 
that, for any given x(t) G J- and any 1(f) > l(x(t)) + e, if ft > /3(e) then 

l(x*(N\t),v*(N\t)) <l_(x(t))+e (15) 

where (x*(N\t), v*(N\t)) are the optimal terminal state and input computed by solving problem V : (l(t) , x(t)). 

Proof 3 Let (x, u) be a state-input pair such that: 

(x.u) — are; min l(x,u) 

subject to 
f(x,u) = X, 

and let V be a sequence of N + 1 control inputs which satisfies constraints (|9b |) ~ ([9fj) and such that 
x(N\t) = x, v(N\t) = u. This sequence is guaranteed to exist by Definition^ Moreover, we have 
l{x,u) — l(x(t)) < l(t), hence also constraint ( |9g[ ) is satisfied and the sequence V is admissible for 
problem V(l(t), x(t)) . The cost associated to V is equal to: 

N-l 

where x(j\t), j G [0,iV] is the state trajectory obtained by applying the sequence V. Consider now any 
other possible state x € X(x(t), N) and input u €E U such that f(x,u) — x, l(x,u) > l(x(t)) + e and 
l(x,u) < l(t). Denote with V a sequence of control inputs which is feasible for problem V(l(t), x(t)) 
and such that x(N\t) — x. Then, the cost associated to V is: 

JV-l 

j(x(t), v) = J2 '( 4 01*). HM) + «). 

3=0 

where x(j\t), j €E [0, N] is the state trajectory obtained by applying the sequence V . Thus, the difference 
J(x(t),V) — J(x(t),V) is given by: 

J{x{t),V) - J(x(t),V) = /3[l(x(t)) - l(x,u)} + 

N t\mj\t)Mm)-mmMm% (16) 

3=0 

and, by exploiting Assumption^ it holds: 

J(x(t),V) - J(x(t), V) < ~/3e + 77, (17) 
with (see the Appendix for a complete proof) 

^ = ^(e«/" j) (sII---h)) > ' (is) 

where a^\a) = a/(a/(. . .a/(a) . . .)) and a^(a) = a. Thus, by setting /3(e) = rj/e and f3 > /3(e), the 
v . ' _ _ 

% times 

inequality 

J(x(t),V) > J(x(t),V) (19) 
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is obtained. Note that, due to the compactness o/U, the value of n is finite, hence also /3(e) is finite. 
Now assume, with the purpose of contradiction, that l(x*(N\t),v*(N\t)) > l(x(t)) + e. Then, inequal- 
ity (|19p would hold true with V = V*(l(t),x(t)), meaning that the cost associated to V*(l(t),x(t)) 
would be higher than the one associated to V. However, this cannot happen, since, by Assumption 
V*(j(t),x(t)) is such that J(x{t),V* (l(t), x(t))) = J* (l(t), x(t)) < J(x{t),V) for any V which is 
feasible for problem V(l(t),x(t)). Hence, the inequality (| 1 5 [) must hold. □ 

Proposition [5] provides an indication on how to tune the parameter j3 in the higher this value, 
the closer the terminal stage cost is to the best one achievable starting from the current state x(t). 
This property induces a result pertaining to the performance achieved by using Algorithm 2 when 
the initial state x(0) belongs to the feasibility set F s . Before stating such a result, we consider the 
following assumption for tracking MPC schemes. 

Assumption 6 (Stage cost in tracking MPC) 

In tracking MPC, the stage cost I enjoys the following properties: 



I) (global minimum) 



II) (lower bound) 



l(x,u) > 0, V(x,u) € F s x U\ {(x s ,u s )} , , 

l(x s ,u s )=0 [M) 



Ojdlfou) - (x s ,u s )\\) < l(x,u), V(x,u) eFxU. K ' 

Note that Assumption [6] is typically satisfied by the stage cost functions used in tracking MPC, 
like ©. 

Theorem 2 Let Assumptions [7][5| hold, let a value of e > be chosen, and let (3 > /3(e). For any 
x(0) € T s , apply Algorithm 2. Then, the following properties hold: 

a) (sub-optimality of the terminal stage cost) 

l(x*(N\t),v*(N\t))-l(x s ,u s ) <e,Vt>0, (22) 

b) (tracking MPC) if Assumption^ also holds, then: 

\\(x*(N\t),v*(N\t))-(x s ,u s )\\<a- 1 (e),yt>0, (23) 

c) (economic MPC) the asymptotic average performance obtained by control law k is bounded as: 

Joo{n)<l{x s ,u s ) + e. (24) 

Proof 4 a) According to Proposition^ if x(0) G J- s then x s G X(x(0), N). Moreover, by Defi- 
nitions Q] and if x s G X(x(0),N) then l(x(0)) = l(x s ,u s ). Therefore, by Proposition^ we have 
l(x*(N\0),v*(N\0)) — l(x s ,u s ) < e. The use of Algorithm 2 and constraint ( |9g| ) force the values 
l(x*(N\t),v*(N\t)) to be not increasing with t, thus the inequality l(x*(N\t), v*(N\t)) — l(x s ,u s ) < e 
holds true for all t > 0. 

b) From (|22p . under Assumption® I) we have 

l{x*{N\t),v*{N\t)) - l(x s ,u s ) 
= l(x*(N\t),v*(N\t)) < e, Mt > 0. 

Then, by Assumption® II) it holds \\(x*(N\t),v*(N\t)) - (a; s ,u s )|| < ^(e), Vi > 0. 

c ) The proof of this claim follows that of Theorem 1 in [2], with little modifications, and it is reported 
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here for the sake of completeness. First of all, note that, for any t > 0, it holds: 

JV-l 

J*(l(t),x(t))=l(x(t),u(t)) + £ l(x*(j\t),v*(j\t)) + 
pl(x*(N\t),v*(N\t)); 

N-l 

j*(i(t+i), x (t+i))< y: i(x*(j\t),v*(j\t)) 

+l(x*(N\t),v*(N\t)) + j3l(x*(N\t),v*{N\t)), 

thus, under Assumptions]]]^ by using (|22|) . for any x(0) G J 75 it holds: 

J*(l(t + l),x(t + 1)) - J*Q(t), x(t)) 
< l(x s ,u s ) + e-l(x(t),u(t)) 

E J*(T(t+i),x(t+i))-j*(T(t),x(i)) 



lim inf 



t=0 



r- 



T+l 



22 l(x s ,u s ) + e-l(x(t),u(t)) 

< lim inf^_ 



E «(*(*).«(*)) 



t = 



= l(x s , u s ) + e — lim sup ; — , . 

At the same time, due to Assumption]^ 

E J*(7(t+l),x(t+l))-J*(T(i),:c(t)) 

lim inf — 

= Um inf ./'(I(r + i),,(T+i))-j-(I(o),x(o)) 

T->-oo _ i+1 

lim inf - J '^W) = 



T->oo 



T+l 



hence 



E *(*(*),«(*)) 

lim sup — — — = Joo(k) < /(a; 5 , w s ) + e 

T->oo J+l 

T/izs bound establishes the result. □ 

According to Theorem [2]-a) , with a sufficiently high value of /3, for any x(0) inside the feasibility set 
of problem V s , the generalized terminal stage cost is always at most e-suboptimal with respect to 
the one related to the optimal pair (x s ,u s ) (j4|). Theorem [2]-b) implies that it is possible to force 
the generalized terminal state-input pair to be arbitrarily close, as e — > 0, to the desired one, for any 
x(0) G J- s . As a consequence, the convergence and stability properties of tracking MPC schemes, with 
a fixed terminal state constraint, can be extended to the case of generalized terminal state constraint, 
by considering an arbitrarily small neighborhood of the desired set point (x s ,u s ). Finally, Theorem 
[U-c) states that the MPC scheme with generalized terminal state constraint achieves an asymptotic 
average performance which is better than that of the optimal fixed point (x s , u s ), plus an arbitrarily 
small tolerance e. 

Remark 6 Proposition]^ provides only a sufficient condition on (3, thus it is often conservative: as a 
matter of fact, with a reasonably high value of (3 the performance of the MPC scheme with generalized 
terminal state constraint, described in Theorem matches those obtained with a fixed, optimally 
chosen terminal state constraint over all the set J- s . This aspect is highlighted in the examples of 
Section]^ 

Practically speaking, Theorem [2] states that the control law k(x) can achieve performance and 
stability properties that are arbitrarily close to those of k s (x), for any x(Q) G J- s . For state values 
a;(0) G J-\J- S , such a comparison does not make sense, since the control law k s is not defined outside the 
set F s , and the optimal fixed point x s is not reachable (see Proposition^ . We now focus our attention 
on initial state values x(0) G J-\J- S , and we present a modified algorithm that guarantees that the 
resulting MPC law enjoys the properties of Theorem^ under the following additional assumption. 
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Assumption 7 (Sequences of steady states with decreasing stage cost) 

For some (eventually very small) Z > there exists a minimal prediction horizon N_ € N such that, 
for any fixed point (S,h) £ Jx U : f(x,u) = x, there exists at least one state-input pair (x,u) with 
the following properties: 

I) f(x, u) = x; 

II) x £ X(x,N); 

III) l(x, u) < max (l(x s , u s ), l(x, u) — e). 

The practical meaning of Assumption [7J is the following: for any fixed point (x, u) in the set JxU. 
there exists another fixed point (x,u) for the dynamics (Q]) (property I)), belonging to the set of 
reachable fixed points X(x, N_) (property II)). The value of the stage cost l(x,u) is either equal to 
the minimal one among all fixed points, l(x s ,u s ), or it is strictly lower, at least by e, than l(x,u) 
(property III)). We note that, if Assumption [1] holds, Assumption [JJ is clearly satisfied at least for any 
fixed point (x,u) £ J s x U, with N_ = N and (x, u) = (x s , u s ). 

The modified MPC algorithm with generalized terminal state constraint is given below. 

Algorithm 3 (Modified RH control with generalized terminal state constraint) 

1. (initialization) Select an arbitrarily small value of e > such that 2e < e, and select (3 > /3(e). 
Given x(0) € J- , choose a value 1(0) such that (7(0), x(0)) € S and let t = 0. Solve the FHOCP 
V(l(0), x(0)); let V*(l(0), x(0)) be a solution. Apply to the system the control input it(0) = 
u*(0|0). Set t = l; 

2. set l{t) — C(J(t ~~ l)j a; (i — 1)); solve the FHOCP V(l(t), x(t)) by initializing the solver A with 
V = {v*(l\t- 1), . . .,v*(N\t- l),v*(N\t- 1)}; let V*(l(t),x(t)) be a solution; 

3. ifl(x*(N\t),v*(N\t)) >l(t)-e and 

l(x*(N\t),v*(N\t)) > l(x s ,u s )+e, then set V*(J(t),x(t)) = V and, consequently, 
(x*(N\t),v*(N\t)) = (x* Nlt _ 1 ,v* N]t _ 1 ); 

4- apply the control input u(t) = n(l(t), x(t)); 
5. set t = t + 1 and go to 2). 

Remark 7 Note that functions n(l(t), x(t)) and ((l(t),x(t)) used in Algorithm 3 are still the ones 
defined in (|13[) . however the values ofv*(0\t), x*(N\t) and v*(N\t), needed for their evaluation, can 
be either the ones corresponding to the solution of problem V(l(t),x(t)), or the ones corresponding 
to the control sequence V, depending on whether the condition l(x*(N\t),v*(N\t)) > l(t) — e and 
l(x*(N\t), v*(N\t)) > l(x s ,u s ) + e is detected at step (3) of the algorithm. In virtue of Theorem\^-a), 
such a condition may hold true only when x(t) ^ !F S , thus Algorithms 2 and 3 might behave differently 
only outside the feasibility set J- s . 

Remark 8 At step (3) of Algorithm 3, we use the tail of the previously computed optimal control 
sequence just for the sake of simplicity. One other option could be, at any time step t such that the 
condition l(x*(N\t),v*(N\t)) > l(t) - e and l(x*(N\t),v*(N\t)) > l(x s ,u s ) + e is detected, to use 
another auxiliary control sequence, computed by solving a tracking optimization problem to reach the 
terminal state (x*(N\t — 1), v*(N\t — 1)) in minimum time. 

The next Theorem shows that the use of Algorithm 3 produces a sequence of terminal stage costs 
{l(x* (N\t), v*(N\t))}^ that converges in finite time, within the arbitrarily small tolerance e, to the 
optimal value l(x s ,u s ). 

Theorem 3 Let AssumptionsUS^and^hold, and consider the closed-loop system obtained by applying 
Algorithm 3 with N > N_. Then, for any value of x(0) € J- there exists a finite number of time steps 
T(x(0)) such that: 

l{x*(N\T(x(0))),v*{N\T(x(0)))) <l(x s ,u s ) + e (25) 
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Proof 5 Consider any x(0) G J 7 . 7/sc(0) G T s ^then by Theorem\^a) we have l(x*(N\0),v*(N\0)) < 
l(x s , u s ) + e, Vf > 0, hence (|25p afco ZioZrfs im'f/i T(x(0)) = 0. If x(0) G J-\J- S , then an optimal terminal 
state-input pair, (x*(iV|0), u*(iV|0)), is computed, and the control input u(0) = k(7(0), x(0)) = i>*(0|0) 
zs applied to the system. Consider now a generic time step t > 0: assuming that l(x*(N\t), v*(N\t)) > 
l(x s , u s ) + e, the following two cases may occur. 

(a) if l(x*(N\t),V*(N\t)) < l(x(t)) - e, then, considering that l(x(t)) = 0(t - l),x(t - 1)) = 
l(x*(N\t — l),v*(N\t — 1)), the optimal terminal stage cost decreases at least by the quantity e: 

l(x*(N\t),v*(N\t))-l{x*(N\t-l),v*{N\t-l)) < -e. 



(b) ifl(x*(N\t),v*(N\t)) > l(x(t)) -e, then at step (3) of Algorithm 3 the solution V*(l(t), x(t)) of 
V(l(t), x(t)) is replaced by the tail of the previously computed optimal solution. Let us denote 
with t the last time step at which the solution V*(1(t),x(t)) of V(J(t),x(t)) was retained. 
Therefore, the control input at time t is u(t) = v*(t — t\t), and the trajectory of the system 
evolves according to the optimal one predicted at time step r. The same procedure is carried 
out as long as case (b) holds true, until the state x(t + N) = x*(N\t) is eventually reached, 
which happens at most in N time steps. In virtue of Assumption^ since N > N_, once the fixed 
point (x(t + TV), u(t + N)) = (x* (N\t),v* (N\t)) has been reached, there exists at least one fixed 
point (x,u), such that x G X(x(t+N),N) andl(x,u) < max (l(x s , u s ), l(x*(N\r), v*(N\t)) - e). 
Hence, by Definition^ we have 1(x{t + N)) < l(x, u) < max (l(x s ,u s ), l(x*(N\r), v*(N\t)) — e). 
Now, ifm&x(l(x s ,u s ),l(x*(N\T),v*(N\T))-l) = l(x s ,u s ), then 1(x(t + N)) = l(x s ,u s ) and, by 
Proposition^ we have 1(x*(N\t + N),v*(N\t + N)) < l(x s ,u s ) + e, thus the result (|3SJ> holds 
true. If, on the contrary, max (l(x s , u s ), 1(x*(N\t), v*(N\t)) - e) = 1(x*(N\t),v* (N\t))-I, then 
1{x{t + N)) < 1(x*(N\t), v*(N\t)) — e and, again by Proposition^ we have 
1(x*(N\t + N),v*(N\t + N)) < 1_{x{t + N)) + e < l(x*(N\r), v*(N\t)) -e + e. Since 2e < e, the 
inequality I {x*(N\t + N),v*(N\t + N)) < 1(x*(N\t),v*(N\t)) - e holds true, i.e. after at most 
N time steps, the terminal stage cost decreases: 
1(x*(N\t + N), v*(N\t + N)) — 1(x*(N\t),v*(N\t)) < -e 

Summing up, while either cases (a) or (b) occur, i.e. as long as condition l(x*(N\t),v*(N\t)) > 
l(x s ,u s ) + e holds true, the quantity l(x*(N\t + N),v*(N\t + N)) generally decreases (not strictly) with 
t. In particular, the decrease is guaranteed to be always at least equal to —e, and to take place at most 
every N time steps: 

Vj > 1, 

l{x*(N\jN),v*(N\jN)) (26) 
- l(x*(N\(j-l)N),v*(N\(j-l)N))<-e. 

Moreover, due to Assumption^ we have: 

l(x*(N\0),v*(N\0))-l(x s ,u s ) 

< <Xl(\\(x* N \ ,V* Nl0 ) ~ (X S ,U S )\\). V 1 ) 



Equations (|26p - (|27[) lead to the following result: 

l(x*(N\jN),v*{N\jN))-l{x s ,u s ) 
< ai{\\{x*{N\Q),v*{N\G))-{x^u s )\\)-je, 

hence when j > ^(U"'Wo),v'(N\o))-(x;n-)\\) the condUlon l( x *(N\jN),v*(N\jN)) < l(x s ,u s ) + e is 
guaranteed to be satisfied. Therefore, we have 

l(x* (N\T(x(0))),v*(N\T(x(0)))) < l(x s ,u s ) + e, 

where 

T{x{0)) = Ar ^(ll(^(^|o),^(^|Q))-(^,^)ll) ^ 

e 

Note that the pair (x*(N\0),v*(N\0)) is a function of the initial state x(0) only, and thus also the 
quantity T(x(0)) is. □ 
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Remark 9 According to Theorem under the considered Assumptions, for any initial state x(0) 
inside the feasibility set T , by using Algorithm 3 the stage cost of the terminal state-input pair converges 
to a value that is arbitrarily close to the optimal one, l(x s ,u s ), after at most a finite number T(x(0)) 
of time steps. Then, it can be noted that all the properties of Theorem^ hold true also if Algorithm 3 
is used, for all time steps t > T(x(0)). 



5 Examples 

5.1 Linear-quadratic tracking MPC 

We consider the following linear system: 



x(t + l) 



1 1 
1 



x{t) 



1 -1 
-1 1 



u(t), 



with state constraint set X = {x 6 R 2 : \\x\\oo < 10} and input constraint set U = {u E M 2 : ||u||oo < 
2}. The control problem is to regulate the state to the origin, hence a tracking MPC approach is used, 
with the following convex stage cost: 

l(x, u) = \\x\\ 2 + ||u||2, 

which satisfies Assumptions [5] and [6] with the Euclidean norm u) || 2 and a t (a) — a. Assumption 
Q] is also satisfied for N > 2, since the system is controllable and the state and input constraints 
contain the origin in their interiors. Assumption [3] is satisfied, considering the Euclidean norm, with 
ctf(a) — 7/ a, 7/ = ||[AS]|| 2 = 2.14 and ai(a) — 7; a, 7; = ||VZ||2 = \/2, where VI is the gradient of 
I. Assumption |4] holds, too, since problem V is convex, due to the convexity of the stage cost and 
constraints, and the linearity of the model. The set of all admissible fixed points for this system is 
{(x, u) G XxU : X2 = 0, u\ = U2}. Since the system is linear and controllable and the state and input 
constraints are convex, it can be shown that, if N > 2 (i.e. the state dimension), for any given fixed 
point x the set of reachable fixed points X(x, N) (fTO)) is convex. Then, due to the convexity of the 
stage cost and of the feasibility set, with a sufficiently small e, for any given fixed point (x, u) £ T x U 
there exists always a reachable fixed point (x, u) g X (x, N) x U, such that l(x, u) < max(0, l(x, u) — e). 
Therefore, Assumption [7] is also valid. 

We first compare the feasibility sets T s and J 7 , obtained with different values of the prediction horizon 
N. Since the system dynamics and the constraints are linear, and the state constraints define a 
compact set, both F s and T are polytopes. The result of the comparison, in Fig. [TJ clearly shows 
that the use of the generalized terminal state constraint yields an enlarged feasibility set, given the 
same horizon N. In particular, the feasibility set T obtained with N = 2 is already quite large, and 
the one obtained with N = 3 corresponds to the maximal region of attraction for the origin, under the 
considered state and input constraints (see Fig[TJ (a)). A similar result can be obtained with the fixed 
terminal state constraint, but only with N > 7 (see Fig[TJ (b)). More in details, the feasibility set T s , 
obtained with a fixed terminal state constraint, corresponds to the TV-step null controllable region of 
the system, subject to the considered state and input constraints, while, thanks to the generalized 
terminal state constraint, the feasibility set T corresponds to the union of all of the TV-step null 
controllable regions of the system, obtained by translating the origin to all of the feasible fixed points, 
again under the considered state and input constraints, hence it is larger than J- s . This aspect is even 
more evident with a larger state constraint set X: in Fig. [21 the case X = {x E M. 2 : \\x\\oo < 100} is 
shown, with N = 4 for both problems V s and V . It can be clearly noted that the set JF is equal, in this 
case, to the union of sets F s centered at all state values {x : X2 =0}, i.e. all fixed points for the system, 
limited only by the state constraint set X. Indeed, in the absence of state constraints, the feasibility 
set J- would be unbounded. This example clearly shows that the proposed generalized terminal state 
constraint is able to remove the main drawback of using a fixed terminal state constraint, i.e. the 
need to use long prediction horizons in order to have a sufficiently large feasibility set, and it may 
also outperform, as far as the feasibility set is concerned, the two-mode approaches, in which it is 
still required for the state to reach in finite time a terminal set, which is positively invariant under a 
terminal control policy. 
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Figure 1: Linear quadratic tracking MPC. Feasibility sets (a) T and (b) T s obtained with different 



prediction horizons N and state constraint set X = {x£ 



< 10}. 
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Figure 2: Linear quadratic tracking MPC. Feasibility sets T (gray) and T s (white) obtained with 
prediction horizon N — A and state constraint set X = {x £ M. 2 : ||x||oo < 100}. 
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Considering the latter constraint set, we apply Algorithm 3 with a prediction horizon N — 4. We 
obtain the value r\ — 150.08 from (fT5|) . and we select e = 0.1, hence the value /3(e) = 1500.8. According 
to Proposition [5J we choose (3 = 1550. Fig. [3] shows the closed-loop trajectory obtained from the 
initial condition x(0) = [—100, 15] T (solid line), together with the trajectories predicted at each time 
step (dashed) and the related terminal states (circles). According to Theorem [3J the terminal stage 




Figure 3: Linear quadratic tracking MPC. State trajectory obtained with the generalized terminal 
state constraint (solid), predicted trajectories (dashed), and predicted terminal states (circles). Initial 
condition x(0) = [-100, 15] T . 

cost converges to zero in finite time, and specifically in 4 steps, as shown in Fig. 2] (solid line), and, 
accordingly, the terminal state x*(N\t) converges to the origin (see Fig. [3]), as stated in Theorem[5]-b). 
We note that the same result is obtained already with (3 = 50, thus showing the conservativeness of the 
lower bound /3(e), as anticipated in Remark [6] A slower convergence is achieved with smaller values 



2500- 




2 4 6 8 10 12 14 16 
t 



Figure 4: Linear quadratic tracking MPC. Course of the terminal stage cost l(x*(N\t), v*(N\t)) ob- 
tained with (3 = 1550 (solid) and with (3 = 1, 0.5, 0.3 and 0.1 (dashed). 

of (3 (see Fig. 01 dashed lines). Finally, we also note that point (3) of Algorithm 3 is never applied 
in this example, i.e. at each time step the newly computed value of l(x*(N\t), v*(N\t)) improves the 
previous one, l(xN\ t -iT v N\t-i)i more than the selected value e, making Algorithm 3 equivalent to 
Algorithm 2. 
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5.2 Economic MPC for an isothermal CSTR 



We consider the model of an isothermal continuous stirring tank reactor (CSTR) from [ID] , with a 
single first-order reaction: 

C ^ D. 

The state variables are the molar concentrations of C and D, indicated as x\ and X2, respectively, 
and the input u is the flow through the reactor, in 1/min. The continuous-time dynamical equations 
of the model are: 

±i = — (ccf - a?i) ^ Kxi 
V S 

± 2 = -rr( c Df - Xi) + k r Xi, 

VR 

where Vr = 10 1 is the volume of the reactor, cq/ — lmol/1 and cuf — Omol/1 are the feed con- 
centrations of C and D, and k r = 1.2 l/(mol min) is the rate constant. We restrict our attention to 
Xi,x% € [0, 1] mol/1. The input constraint set is U = {u € K : < u < 20}. The control objective is 
to minimize the following economic cost functional: 

l(x, u) = 30 — (2 u X2 — -u), 

which attains its minimum l(x, u) = for u = 20 and {x : x-i — 1}. The best steady state for 

( \ 5 

the system, in terms of stage cost /, is (x s ,u s ) = [ 

l(x s ,u s ) = 24. The control sampling time is equal to 0.5 min. We use the generalized terminal state 
constraint with N = 12. Fig. [5] shows the course of the states of the closed-loop system, starting 
from the initial condition a;(0) = [1,0. 1] T . The terminal stage cost, reported in Fig. [6] with different 



12 ] , and the corresponding cost is 



l 




i.i 1 — 1 — 1 — 1 — 1 — 1 — ' — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 — 1 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
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Figure 5: Isothermal CSTR. Course of the state variables x±(t) (solid) and a^fi) (dashed) obtained 
with the generalized terminal state constraint. Initial condition: x(0) — [1,0. 1] T . 

values of /3, rapidly converges to the optimal value, showing that the performance obtained with the 
generalized terminal state constraint are equivalent, in this case, to those achieved with an optimally- 
chosen fixed terminal state constraint. As expected from Proposition [2] the speed of convergence 
is higher with higher values of (3 (see Fig. [5]). After a first transient, a periodic state trajectory is 
obtained (see Fig. EJ, cycling between the values x = [0.57, 0.43] T mol/1 and x = [0.30,0.69] T mol/1, 
and the corresponding input jumps between its extreme values, 1/min and 20 1/min, respectively. 
The corresponding asymptotic average cost Joo is equal to 21.14, i.e. better than the one associated 
to the optimal steady state (x s ,u s ), as expected from the result (see [2]) and Theorem [2] 
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Figure 6: Isothermal CSTR. Course of the terminal stage cost l(x*(N\t), v*(N\t)) obtained with 
j3 = 10 (solid), 1 (dashed), 0.1 (dot), and 0.01 (dash-dot). Initial condition: x(0) = [1,0.1] T . 



5.3 Tracking MPC of an inverted pendulum system 

We consider the equations of motion of a pendulum written in normalized variables (see [5]): 



xi(t) = x 2 (t) 

X2(t) = sin (xi(t)) — u(t) cos (xi(t)) 



(28) 



where the input constraint set U is: 



V = {ue 



< 0.5} 



The state variables are the pendulum angular position x\ (modulo 27r) and angular speed X2- We 
want to track the unstable set point [xi,X2 1 u] T — [0,0, 0] T , starting from the downright position 
[x\, X2,u] T — [tt, 0, 0] T . The following discrete time model to be used in the nominal MPC design has 



(29) 



been obtained by forward difference approximation of 

Xl,t+1 = %l,t + T s X2,t 

X2,t+i = x 2 ,t + T s (sin (xi, t ) - u t cos (xi, t )) 
with sampling time T s — 0.05 s. We choose a stage cost of the following form: 



sin (xi/2) 
x 2 



+ u 2 R, 



225 
1 

the system is given by: 



with Q 



l{x, u) = [sin {xi/2) , x 2 ] Q 
and R = 1, and we choose (3 = 100. The set of all admissible fixed points for 



{(x, u) : sin (x\) — ucos (xi) — 0, X2 — 0, |u| < 0.5} . 



and it is depicted in Fig. [7] (solid lines) . It can be noted that such a set is not connected, in particular 
its projection on the state space is given by: 

{x : xi £ [— arctan(0.5), arctan(0.5)] U [tt — arctan(0.5), tt + arctan(0.5)]; x 2 = 0} . 

In this situation, the use of a generalized terminal state constraint might not be able to drive the 
system state to the target x s = 0, starting from a steady state x such that x\ G [tt — arctan(0.5), tt + 
arctan(0.5)] , unless a sufficiently large horizon N is chosen, as stated in our Assumption!?) As an exam- 
ple, Fig. [7J(gray line) shows the trajectory obtained with N = 60, starting from the steady state/input 
[xi, X2, u] T — [tt, 0, 0] T . It can be noted that the obtained trajectory approaches a limit cycle, confined 
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Figure 7: Inverted pendulum example: set of admissible steady states (solid lines), closed- loop tra- 
jectory obtained with horizon N — 60 (gray lines) starting from a;(0) = [tt, 0] t and state trajectory 
achievable in N = 141 steps from the initial steady state x = [tt — arctan (.5), 0] T and input u = —0.5. 

in a set from which an admissible steady state x : x\ £ [tt — arctan(0.5), tt + arctan(0.5)] is reachable 
within the considered horizon, yet an admissible steady state x : x± £ [— arctan(0.5), arctan(0.5)] is 
not reachable. In line with our result of Theorem^ the average cost associated to such a limit cycle 
is equal to 195.89, while the best reachable steady state has an associated cost of 213.33. 
A value of N that satisfies Assumption [7] is TV = 141, as shown in Fig. [Jj (dash-dot line) by the 
related predicted trajectory starting from the initial steady state x = [tt — arctan (.5), 0] T and input 
u = —0.5. However, note that since Assumption [Jj is only sufficient for Theorem[3]to hold, also lower 
values of N might give the desired results. In particular, we show here the results with N = 100. 
Fig. HJa) shows the obtained time courses of the state and input variables: the controller is able to 



(a) (b) 




2 4 6 8 10 12 14 2 4 6 8 10 12 

time (s) time (s) 



Figure 8: Inverted pendulum example: time courses of the states x\(t), X2(t) and input u(t) obtained 
with (a) the generalized terminal state constraint with N — 100 and (b) a fixed, optimally chosen 
terminal state constraint with TV = 200. 

swing up the pendulum in about 12.5 s. The state trajectory in the phase plane is depicted in Fig. 
[Ha) (solid black line), together with the trajectories predicted at each time step and the correspond- 
ing terminal steady states. In particular, it can be noted how the sequence of terminal state-input 
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x (rad) " " " " ~" 1 2 3 4 5 6 7 8 9 10 11 12 13 14 

1 time (s) 

Figure 9: Inverted pendulum example: (a) closed loop state trajectory obtained with the generalized 
terminal state constraint (solid black line), with N = 100, and trajectories predicted at each time 
step (gray dashed lines). Predicted terminal states are marked with '*'. Dash-dot black line: closed 
loop state trajectory obtained with a fixed terminal state constraint, and N = 200. (b) Course of the 
stage cost associated to the terminal state input pair, l(x*(N\i),u*(N\t)). 

pairs, (x*(N\t),u*(N\t)), is approximately equal to (3.60,0) for t < 3 s, then it jumps to (2.67,0) for 
t £ [3s, 6.6 s and finally converges to the target steady state, after about time 132 steps (i.e. 6.6s). 
These numerical values depend on the initial state, on the input constraints and on the chosen stage 
cost function and prediction horizon. Fig. IH^b) shows the same result in terms of terminal cost value, 
l(x*(N\t),u*(N\t)). 

We also applied a tracking MPC law with a fixed terminal state constraint, i.e. x(N\t) = 0. With 
the considered input constraint, the corresponding FHOCP ([3]) results to be unfeasible for horizons 
TV < 200, i.e. twice the one used with the generalized terminal state constraint. The trajectories 
obtained with the fixed terminal state constraint are shown in Figs. [9(a) (dash-dot black line) and 
[8(b): the controller is able to swing up the pendulum in about lis. Therefore, this example confirms 
that 1. given the same prediction horizon, the use of a generalized terminal state constraint can 
give a feasibility set which is larger than that obtained with a fixed, optimally-chosen terminal state 
constraint, and 2. that the performance obtained with the generalized terminal state constraint, in 
terms of swing-up time, are very close to those achieved with a fixed terminal state constraint, but 
the required prediction horizon (i.e. computational effort) is much shorter. 

6 Conclusions 

We investigated a generalized terminal state constraint for Model Predictive Control and proved 
that, under reasonable assumptions, the resulting closed-loop system has performance similar to those 
of MPC schemes with a fixed, optimally chosen terminal state constraint, while enjoying a larger 
feasibility set and lower computational complexity. These features have been highlighted through 
three examples, considering tracking MPC and economic MPC, and both linear and nonlinear systems. 

Appendix 

Proof of equations (fT7| - (fT8|) . Consider equation (fT6|) . The term (3\l(x(t)) — l(x,u)] is less than — fje 
because j3 > and the pair (x,u) is such that l(x,u) > l(x(t)) + e. The terms [l(x(i\t),v(i\t)) — 
l(x(i\t), v(i\t))], i £ [0,N — 1] can be bounded on the basis of Assumption [31 as follows: 
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case: i = 0; 

i(z(0|f),«(0|t)) - l{x{Q\t),v{0\t)) < \l{x(0\t),v(0\t)) - l(x{0\t),v{0\t))\ < cu\\v(Q\t) - v(0\t)\\) 

CcLSC I — 1 ' 

I(5(l|t),«(i|t))-I(«(l|*),*(l|*)) < |/(5(l|*),17(l|t))-i(*(l|t) s «(l|*))| 

< a,(||(S(l|t),U(l|*)) - (x(l\t),v(l\t))\\) < a,(||3f(l|t) - x(l\t)\\ + \\v(l\t) - v(l\t)\\) 

< at(a f Q\v(p\t) ~ *(0|i)ID) + W) - 

=> Z(x(l|t),*J(l|i)) - J(f (l|t),f)(l|t)) < ai(«/(ll^o|* ~ OoitlD) + aiQHMt) - «(1|*)||) 

= «, ^jC o «?~^(ll^ai*)-«0'l*)ll)^ ; 

generic i 

/(x(*|t),1;(*|i)) - Z(x(i|t),«(i|i)) < a ; (± a { f j \\\v{j\t) - v{j\t)\\^j ■ 
Thus, the second term in (|16p can be bounded by 77 = an ( ^ ai* J ^ [ max | j TT — v\\ ] I . □ 

\j=0 1 \v,v£V J I 
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